NCB1 Blast:Protein sequence(12 letters) Page 1 of 1 

SEGlvEkjlC ComPfiXisoAJ 4 of t/njii pi 

>gb I EAW85671.il B BAIl-associated protein 3, isoform CRA b [Homo sapiens] 
Length-1169 

Score = 24.8 bits (51), Expect = 586 

Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 2 ELLRKKV 8 

ELLRKKV 
Sbjct 764 ELLRKKV 770 



http://www.ncbi.nlm.nih.gov/BLAST/Blast.cgi 



6/12/2007 



NCBI Blast:Protein sequence(263 letters) Rage 23 uf 36 

Query 65 KDVV DITDELRRRLRGAKGKSVCKICK-KIMLEIAVSKASVVATGECGMDT 114 

K +V +1 D + + + C IC KI + + +A + G D 

Sb j ct 62 KHIVYDMRETFKTEI IDYFVEEYKQGRTPNPCFICNSKIKFGLFLEQAL KDGFDK 116 

• Query 115 IAGAVLDVSRRTGIEPEFVQLPKRYFNGDDRIIVR 14 ! 

IA TG + ++ K GD+R ++R 
Sbjct 117 IA TG HYAKIEKTEIEGDERFLLR i: 



>ref |ZP_01778173.1| tRNA ( 5-methylaminomethyl-2-tfhiouridylate ) -methyltransf erase 
[Petrotoga mobilis SJ95] 

gb|EDJ82392. 1| tRNA ( 5-methylaminomethyl-2-thirfuridylate ) -methyltransf erase 
[Petrotoga mobilis SJ95] 
Length=367 

Score = 37.7 bits (86), Expect = 0.70, Methodf: Composition-based stats. 
Identities - 37/131 (28%), Positives = 61/131/(46%), Gaps = 30/131 (22%) 

VSGKKICVLISGGMDSAVATKILQLSNTDVRGLIf ITHRWMWFTPEI 53 

+S KK+ +L+SGG+DS+VA +L+ N V GLM ++ PE 

MSNKKVLMLMSGGVDSSVAAYLLKEQNYHVIGlilFKTVSDVVFSLIPEKKKVCCSPSDTQ 60 

EIKRISKMLGIKD— VVDITDELRRRL /--RGAKGKSVCKICKKIM LEIA 98 

+ +1+ L + D +VDI E + + + / +K+C+C+ LEIA 

DALKIADKLDLDDFQIVDIKKEFKEKIIDYFIKTYKEGKTPNPCMLCNRFFKFGKALEIA 120 

VS-KASVVATG 108 

S A V++G 
HSYGADWVSSG 131 



Query 


8 


Sbjct 


1 


Query 


54 


Sbjct 


61 


Query 


99 


Sbjct 


121 



>ref | ZP_00144111 . 1 | tRNA ( 5-methylaminorrfethyl-2-thiouridylate) -methyltransf eras 
[Fusobacterium nucleatum subsp. vincentii/ ATCC 49256] 

gb|EAA24275.1| tRNA ( 5-methylaminomethil-2-thiouridylate ) -methyltransf erase 
[Fusobacterium nucleatum subsp. vincentii ATCC 49256] 
Length=362 

Score = 37.4 bits (85), Expect = 0.85/ Method: Composition-based stats. 
Identities = 42/181 (23%), Positives =/ 87/181 (48%), Gaps = 30/181 (16%) 

sgkkicvlisggmdsavatkilqlsnJdvrglhithrwmwftpeieikriskmlgikd-v 67 

S K+ V +SGG+DS+ + L+ /D+ G+ + + + + K++ LGI+ V 

SNIKVGVAMSGGVDSSTVAYLLKQQGf DI FGVTMK TFKDEDSDAKKVCDDLGIEHYV 78 

VDITDELRRRL RGAKGKSf/CKICKK IMLEIAVSK-ASVVATGECGMD 113 

+ D+ + E + + + + J C +C + ML+ + SK AS +ATG 

LDVRNEFKEKVMDYFVNEYMNGRTPMPCMVCNRHIKFGKMLDFILSKGASFMATGH--YT 136 

TIAGAVLDVSRRTGIEPEFV--QLETKRYFNGDDRI — IVRPLIRIHESDVKRLARLLGVK 169 

+ +L V + + + Q+/K ' DR+ 1+ P+. + + + + + LA+ + GV+ 
KLKNGLLSVGDDSNKDQVYFLSQ^feK DRLSKI I FPVGDLEKTKLRELAKQMGVR 190 



Query 


9 


Sbjct 


22 


Query 


68 ■ 


Sbjct 


79 


Query 


114 


Sbjct 


137 


Query 


170 


Sbjct 


191 



V 
V 
V 



170 
191 



I; 



>ref |NP_956881 . 1 1 LLliS guanine monophosphate synthetase [Danio rerio] 



gb| AAH56730 . 1 | LUi£j Guanine monphosphate synthetase [Danio rerio] 
Length=359 

Score =37.4 bits (85), Expect = 0.87, Method: Composition-based stats. 
Identities = 23/64 (35%), Positives = 35/64 (54%), Gaps = 2/64 (3%) 

Query 4 LRKKVSGKKICVLISGGMDSAVATKILQ — LSNTDVRGLHITHRWMWFTPEIEIKRISKM 61 

+ R+KV K+ VL+SGG+DS V T +L L+ V +HI + +M ++ 
Sb j ct 229 I REKVDKSKVLVLLSGGVDSTVCTALLNKALNQEQVI AVHI DNGFMRKRESQSVEEALTK 288 

Query 62 LGIK 65 
LGIK 

Sbjct 289 LGIK 292 



http://www.ncbi.nlm.nih.gov/BLAST/Blast.cgi 



6/12/2007 




NCBI Blast:Protein sequence(263 letters) i * 

>ref | ZP_01675115.1| tRNA ( 5-methylaminomethyl-2-thiouridylate ) -methyltransf erase 
[Candidatus Desulf ococcus oleovorans Hxd3] 

gb|EAX53590. 1 | tRNA ( 5-methylaminomethyl-2-thiouridylate ) -methyltransf erase 
[Candidatus Desulf ococcus oleovorans Hxd3] 
Length=358 

Score - 37.0 bits (84), Expect =1.0, Method: Composition-based stats. 
Identities = 20/55 (36%), Positives = 29/55 (52%), Gaps = 3/55 (5%) 

13 ICVLISGGMDSAVATKILQLSNTDVRGLHITHRWMWFTPEIEIKRISKMLGIKDV 67 

.1 VL+SGG+DS VA +L+ S DV +H + PE R+ K+ D+ 

19 I AVLVSGGI DSLVAAHLLKQSGADVTAI HFL TGYEPEDRAGRLEKLFTQMDI 7 0 

>ref |YP_388864.1| E hypothetical protein Dde_2372 [Desulf ovibrio desulfuricans G 

gb| ABB39169. 1 1 ESI conserved hypothetical protein [ Desulf ovibrjy6 desulfuricans G2 
Length=357 ^ 7 

Score =37.0 bits (84), Expect =1.1, Method: Composition-ttased stats. 
Identities = 28/87 (32%), Positives = 44/87 (50%), Gaps = (10%) 

Query 16 LISGGMDSAVATKILQLSNTDVRGLHITHRWMWFTPEIEIKRISKMLOIK-DVVDITDEL 7 4 

L+SGG+DS +A K++Q V+ LH + F E + R M >G+ VVD++D+ 

Sbjct 14 LLSGGLDSILAVKVVQEQGLRVKCLHFVTP--FFGKPREVNRWESIWGLDIAVVDVSDDF 71 

Query 7 5 RRRLRG AKGKSVCKICKKIML 95 

+RG K + C CK +M+ 

Sbjct 72 VSMMRGRPAHGYGKVLNPCVDCKILMM 98 

>ref |NP_602720.1| E3 tRNA ( 5-methylaminomethyl-2-thi/ouridylate ) -methyltransf eras 
[Fusobacterium nucleatum subsp. nucleatum ATCC 2558/6] 

gb|AAL94019.1| E£3 tRNA ( 5-methylaminomethyl-2-thl6uridylate ) -methyltransf erase 
[Fusobacterium nucleatum subsp. nucleatum ATCC 25/861 
Length=343 

Score =37.0 bits (84), Expect =1.1, Method:/composition-based stats. 
Identities = 36/114 (31%), Positives = 58/114/(50%), Gaps = 16/114 (14%) 

Query 11 KKICVLISGGMDSAVATKILQLSNTDVRGLHITyHRWMWFTPEIE-IKRISKMLG-IKDVV 68 

KK+ + +SGG+DS+V+ +L+ +V G+ +/ + +IE K++ LG I +VV 

Sbjct 2 KKVVIGMSGGVDSSVSAYLLKEQGYEVIGITyNQHLEENSKDIEDTKKVCDKLGIIHEVV 61 

Query 69 DITDE LRRRLRG AKGKSVCKIc/ KIMLEIAVS-KASVVATG 108 

+ 1 + ++ L G K S C 1(2 KI+ +IA A VATG 

Sbjct 62 NIRKDFENIVIKYFLEGYNSGKTPSPCITCDDEIKFKILFDIADKYNAEYVATG 115 

>ref|ZP 01511681.11 PP-loop [Burkhold/ria phytofirmans PsJN] 

gb|EAVTT3756.1| PP-loop [Burkholderiaf phytofirmans PsJN] 
Length=336 

Score = 37.0 bits (84), Expect = /. 2, Method: Composition-based stats. 
Identities = 44/175 (25%), Positives = 80/175 (45%), Gaps = 28/175 (16%) 

SGKKICVLISGGMDSAVATKILCL SNTDVRGLHITHRWMWFTPEIEIKRISKML 62 

+G K+ V +SGG DS +IZ N D+ +++ + F PE + K L 

NGDKVMVCLSGGKDSYAMLEltMRLRERAPINFDIVAVNLDQKQPGF-PEHVLPEYLKQL 124 

GIKDVVDITD--ELRRRLRGAKGKSVCKICKK IMLEIAVS-KASVVATGECGMDTI 115 

I ++ D + +RL/ +GK+ C +C + 1+ +A A+ +A G D + 
DIPFHIENQDTYSIVKRKV-PEGKTTCSLCSRLRRGILYRVAGELGATKIALGHHRDDIL 183 

AGAVLDV SRRTGI^PEFVQLPKRYFNGDD--RIIVRPLIRIHESDVKRLARL 165 

+L+ + + G4 P+ DD I++RPL + E+D+++ A L 

QTLLLNMFYGGKLKGMPPKL QSDDGKNIVIRPLAYVKETDLEKYAEL 230 

>ref |YP_012451.1| E3 hypothetical protein DVU3241 [ Desulf ovibrio vulgaris subsp. 
str. Hildenborough] 

ref | YP_965598. 1 | E3 PP-loop domain protein [ Desulf ovibrio vulgaris subsp. vulgar 



Query 


9 


Sbjct 


66 


Query 


63 


Sbjct 


125 


Query 


116 


Sbjct 


184 



http://www.ncbi.nlm.nih.gov/BLAST/Blast.cgi 



6/12/2007 



I oil 



NCBI Blast:Protein sequence(263 letters) I . 

>ref |NP_988352.1| S hypothetical protein MMP1232 [Methanococcus maripaludis S2] 

conserved hypothetical protein [Methanococcus maripaludis S2] 



emb|CAF30788.1| 

Length=296 

Score = 136 bits (342), Expect = le-30, Method: Composition-based stats. 
Identities = 90/257 (35%), Positives = 141/257 (54%), Gaps = 27/257 (10%) 



Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 



11 

39 

69 

99 

121 

159 

171 

214 

231 

274 



KKICVLISGGMDSAVATKILQLSNTDVR-GLHITHRWMWFTPEIEIKRISKMLGIKDVV- 
+ KI V++SGG DSAV+ I + J +V + H H+W W E K+ + + + + 

EKIIVMLSGGKDSAVSLAIAKDLGJjNVHLCVHFVHKWSWDISTNEAKKLADRFISIVPIIFP 

DITDELRRRLRGAKGKSVCKICKKtM LEI A- VSKAS VVATGECGMDT I AGAVL 

DIT+EL ++ +GAKGKS+C+ICK CM ++IA V A ++ TGE ++ IAG V 
DITEELAKKTQGAKGKSICRICKTIIMKARMMDIAKVENAKIIMTGETALEKIAGPVFQYI 

DVSRRTGIE PEFVQLPKRYFNGDDRI - I VRPLI RI HESDVKRLARLLGVKV 

V R+ E E ++PKRY +1 RPLIR+ DV L + ' + + 

RENNASVKRKDEFELYKKMEITKVfKRY KIHFFRPLIRVGHFDVFNLQKYYKI DI 



RRVGETGDLRRGRREGCPLQHLDPV 
+RV E G+ REGC LQ+ P 

KRVSEAGNKIGYWREGCCLQYCSP1 

FRIILEGSPEERRHVAE 24 7 

I + PEE + + + + 
KEITV— IPEEEKYLKK 288 



/dvtdelmdevwdvnveallvarrlgrrvsvkwps 
++t el d+++ +n +a +ar g r s+ p+ 
:elttelfddlykinkkateiardcgfrasitlpa 



68 

98 

120 

158 

170 

213 

230 

273 



>ref|YP 001096890.11 □ PP-loop domainl protein [Methanococcus maripaludis C5] 

gb| AB034675. 1 | E3 PP-loop domain protiin [Methanococcus maripaludis C5] 
Length=296 

Score = 133 bits (335), Expect = 9e-l30, Method: Composition-based stats. 
Identities = 88/258 (34%), Positives =142/258 (55%), Gaps = 27/258 (10%) 



Query 


7 


Sbjct 


35 


Query 


66 


Sbjct 


95 


Query 


120 


Sbjct 


155 


Query 


167 


Sbjct 


210 


Query 


227 


Sbjct 


270 



KVSGKKICVLISGGMDSAVATKILQLS 
K+ +KI V+ + SGG DSAV+ + 4- 



klkdekiivmlsggkdsavslavakel(;lnvhlcvhfvhkwswdistneakkladrfnip 



dvv-ditdelrrrlrgakgksvckicke 
+ ++t+el ++ +gakgks+c+ick 



tdvr-glhithrwmwftpeieikriskmlgik 
+v +h h+w w e k+++ " i 



IM LEI AVSK- AS VVATGECGMDT I AGAV 

IM ++IA + A ++ TGE ++ IAG V 



i i fpnvteelakktqgakgks icrick1 imkarmi di akeenaki imtgetaleki agpv 

l dvsrrtgie pefvqi pkryfngddri-ivrplirihesdvkrlarll 

+v r+ e e +h pkry +1 rplir+ dv l + 
fqyirenntnvkrkdefelykkmeitkvpkry kihffrplirvghfdvfnlqkyy 

gvkvrrvgetgdlrrgrregcplqh\dpwvdvtdelmdevwdvnveallvarrlgrrvsv 
+ + + rv e g+ regc lq+ \ 3 + + t el d+++ +n +a +ar g r s + 
nidikrvseagnkigywregcclqyc3lpdcelttelfddlykinkkateiarengfrasi 

kwpsfriilegspeerrh 244 \ / 

P+ I + PEE ++' \l 
TLPAKEITV--IPEEEKY 285 V 



65 

94 

119 

154 

166 

209 

226 

269 



>ref|ZP 01702630.11 thiamine biosynthesis protein [Methanococcus vannielii SB] 

gb|EAY5lll5.1| thiamine biosynthesis protein [Methanococcus vannielii SB] 
Length=299 

Score = 131 bits (329), Expect = 5e-29, Method: Composition-based stats. 
Identities = 87/260 (33%), Positives = 141/260 (54%), Gaps = 27/260 (10%) 

Query 8 VSGKKICVLISGGMDSAVATKILQLSNTDVR-GLHITHRWMWFTPEIEIKRISKMLGIKD 66 

++ +KI V++SGG DSAV+ I + +V +H H W W E K+++ +K 

Sbjct 36 LNNEKIVVMLSGGKDSAVSLAIAKSLGLNVHLCVHFLHDWSWNISTDEAKKLADRFNVKI 95 

Query 67 VV-DITDELRRRLRGAKGKSVCKICKKIM LEIAVSK-ASVVATGECGMDTIAGAVL 120 

+ DIT+EL ++ +GAKGKS+C+ICK IM +EIA + A ++ TGE ++ IAG + 
Sbjct 96 I FPDITEELLKKTQGAKGKSICRICKTIMKARMVEIAKEENAKI IMTGETALEKIAGPI F 155 

Query 121 D VSRRTGIE PEFVQLPKRYFNGDDRI -I VRPLI RIHESDVKRLARLLG 167 

+ + R++ E E ++PKRY +1 RPLIR+ D+ L + 



http://www.ncbi.nlm.nih.gov/BLAST/Blast.cgi 



6/12/2007 



j^JCBI Blast:Protein sequence(263 letters) 

EHIRENNKNIKRKSEQELYSQMELTKVPKRY KIHFFRPLIRVGHFDIFNLQKYYN 



Sbjct 
Query 
Sbjct 
Query 
Sbjct 



156 
168 
211 
228 
271 



VKVRRVGETGDLRRGRREGCPLQHLDPWVDVTDELMDEVWDVNVEALLVARRLGRRVSVK 
+++ RV E G+ REGC LQ+ P H EL D ++ VN A +AR+ G R S+ 

LEINRVNEAGNKIGYWREGCSLQYCSPEAMISKELFDNLYVVNKLATEIARKNGFRASIV 

WPSFRI ILEGSPEERRHVAE 24 7 

P I + P + + +A+ 
LPEKEITV--IPNTKENIAK 288 



210 
227 
270 



>ref |NP 248010.11 □ hypothetical p/ 
2661] " / 
sp|Q58422|Y1016 METJA Hypothetical 



otein MJ1016 [Methanocaldococcus jannaschii D 



protein MJ1016 

gb | AAB99022.il E hypothetical protein MJ 1016 [Methanocaldococcus jannaschii DS 
2661] 

Length=294 



Score = 127 bits (318), Expect = 
Identities = 81/243 (33%), Positive 



9e-28, Method: Composition-based stats, 
= 133/243 (54%), Gaps = 14/243 (5%) 



Query 


13 


Sbjct 


37 


Query 


71 


Sbjct 


97 


Query 


126 


Sbjct 


157 


Query 


179 


Sbjct 


217 


Query 


239 


Sbjct 


277 



ICVLISGGMDSAVATKILQLSNTE VRGL 
I V+ SGG DS+ A + + 4 + 

IVVMASGGKDSSTAIALAKDLGLMEYLIHFYHRWSWDVSKKMVEKLSKKFDI 



TDELRRRLRGAKGKSVCKICKKIiy LEIAVSKAS 



TDEL +R +GAKG S+C+ICK IN 



TDELLKRTKGAKGSSICRICKNINKDKAVDISKEKGI 



PEE 241 

+ E 

KDE 279 



>ref|ZP 01799296.11 conserved hypotftet 
gb|EDKT6291.1| conserved hypothetical 
Length=302 



Score - 125 bits (314), Expect = 
Identities = 82/239 (34%), Positive 



Query 


11 


Sbjct 


37 


Query 


69 


Sbjct 


97 


Query 


122 


Sbjct 


157 


Query 


177 


Sbjct 


214 



-HITHRWMWFTPEIEIKRISKMLGIKDVV-DI 
L H HRW W + ++++SK I + +1 

PVIFYNI 



+ AV 



70 
96 
125 



-VVATGECGMDTIAGAVLDVSRR 
++ TG+ ++ ++GAV++ R 
[ RI IMTGDSALEKVSGAVMNYLRD 156 



TGIEP EFVQLPKRYFNGDDjR 

E E + P++Y G Df 

VYGEVVYNKMELTPVPQKYSKGKDKEVLFFRPLIRLACEDVLKLMDYYNIEIERAHEVGD 



--IIVRPLIRIHESDVKRLARLLGVKVRRVGETGD 178 
+ RPLIR+ DV +L +++ R E GD 

216 



LRRGRREGCPLQHLDPWVDVTDEL^DEVWDVNVEALLVARRLGRRVSVKWPS 

REGC LQ+ D + ++L 
KIGFWREGCCLQYADENALLNEKL 



FRIILEGS 238 
+E++ N A VA++ G R S+K PS +I++ 
NELYKYNKIATEVAKKHGFRASIKLPSKKIMVVPK 27 6 



ical protein [Methanococcus aeolicus Nanka 
protein [Methanococcus aeolicus Nankai-3] 



e-27, Method: Composition-based stats. 
= 129/239 (53%), Gaps = 17/239 (7%) 



KKI CVLI SGGMDS AVATKI LQLSNTPVR 
K I + + SGG DS+ A + + 

KPIIAMLSGGKDSSTALAMAKDLDLblKLAVHFVHKWSWELPKTQVQKI 



GLHITHRWMWFTPEIEIKRISKMLGIKDVV- 
H+ +H H+W W P+ ++++I+K + + 

TKEYDVPLIFY 



DITDELRRRLRGAK--GKSVCKICKflCIM 
DITD L +R++G GK++CKICK 
DITDVLCQRIQGGHSWGKNICKICK' 



VSRRTG IEPEFVQLPKRYFNCfDDRI 

T + E + PK+Y N 

YVNETYGSYVYSKMELTPVPKKYRNlK 



G+ 



REGC LQ+ + 



GNKIGYWREGCCLQYCNSDTKLNTELFDDLYYYNNRLTALARKHGFRAS 



>ref |NP_126247. 1 1 [£3 hypothetical prq :ein PAB1992 [Pyrococcus abyssi GE5] 



-LEIAV-SKASVVATGECGMDTIAGAVLD 
IM EIA KA + TGE ++ I AG + 

IMKEKIREIAYKEKAYTILTGESALEKIAGPIFQ 



68 
96 
121 
156 
176 



IVRPLIRIHESDVKRLARLLGVKVRRVGET 
RPLIR +D++ L + VK+ RVGE 
- - FFRPLI RCGCNDI EALQKH YNVKI ERVGEV 213 



GDLRRGRREGCPLQHLDPWVDVTDE] .MDEVWDVNVEALLVARRLGRRVSVKWPS 



+ EL D+ + + N 



FRIIL 

+AR+ G R S+K PS +1+ 
IKMPSKEVII 



235 
272 



Predicted ATPase J>f the PP-loop super family [Pyrococcus abyss 



emb|CAB49478.1| 

GE5] 

Length=260 

Score = 110 bits (274), Expect = le-22, Method: Composition-based stats. 
Identities = 86/233 (36%), Positives = 130/233 (55%), Gaps = 22/233 (9%) 



http://www.ncbi.nlm.nih.gov/BLAST/Blast.cgi 



6/12/2007 



NCBI Blast:Proteiii sequence(263 letters) 



0} 



ref|Z F 01562555,11 PP-loop [Burkholderia cenocepacia MCO-3] >. . 

ref I YP_836621 . 1 1 PP-loop domain protein [Burkholderia cenocep. . 
ref 1 ZP _G 0990938. 1 | COG0037 : Predicted ATPase of the PP-loop s.. 

ref |XP 0013 5 7491. 11 GA21637-PA [Drosophila pseudoobscura] >gb. . 

ref I ZP 015181 42. 1 1 PP-loop [Comamonas testosteroni KF-1] >gb|.. 
gb'| ABK7 8113. 1 1 tRNA (•5-methylaminomethyl-2-thiouridylate ) met.. 

Alignments 



Bagc 5 of !R3 

p- 1 



35.0 

34.7 

34.7 
34.77 
34 77 



>ref |NP_614974.1| Predicted ATPase of the tfP-loop superfamily implicated in ce 
cycle control [Methanopyrus kandleri AV19] / 

gb | AAM02904.il B Predicted ATPase of the Pg-loop superfamily implicated in cell 
cycle control [Methanopyrus kandleri AV19] 
Length=264 

Score = 481 bits (1239), Expect = le-134,/ Method: Composition-based stats, 
Identities = 262/264 (99%), Positives = 263/264 (99%), Gaps = 1/264 (0%) 

LELLRKKVSGKKICVLISGGMDSAVATKILQLSNTDVRGLHITHRWMWFTPEIEIKRISK 60 
+ELLRKKVSGKKICVLISGGMDSAVATKILQLSNTDVRGLHITHRWMWFTPEIEIKRISK 
MELLRKKVSGKKICVLISGGMDSAVATKILpLSNTDVRGLHITHRWMWFTPEIEIKRISK 60 

MLGIK-DVVDITDELRRRLRGAKGKSVCKICKKIMLEIAVSKASVVATGECGMDTIAGAV 119 
MLGIK DVVDITDELRRRLRGAKGKSVCKICKKIMLEIAVSKASVVATGECGMDTIAGAV 
MLGIKIDVVDITDELRRRLRGAKGKSVCMECKKIMLEIAVSKASVVATGECGMDTIAGAV 120 

LDVSRRTGIEPEFVQLPKRYFNGDDRIIfRPLIRIHESDVKRLARLLGVKVRRVGETGDL 17 9 
LDVSRRTGIEPEFVQLPKRYFNGDDRI IA/RPLIRIHESDVKRLARLLGVKVRRVGETGDL 
LDVSRRTGIEPEFVQLPKRYFNGDDRI yVRPLIRIHESDVKRLARLLGVKVRRVGETGDL 180 

RRGRREGCPLQHLDPWVDVTDELMDEVWDVNVEALLVARRLGRRVSVKWPSFRIILEGSP 239 
RRGRREGCPLQHLDPWVDVTDELMDEWDVNVEALLVARRLGRRVSVKWPSFRIILEGSP 
RRGRREGCPLQHLDPWVDVTDELMDEVWDVNVEALLVARRLGRRVSVKWPSFRIILEGSP 24 0 

EERRHVAECVWYRWVRAGRPRRYR k 63 
EERRHVAECVWYRWVRAGRPRRYR 
EERRHVAECVWYRWVRAGRPRRYR! /264 



Query 


1 


Sbjct 


1 


Query 


61 


Sbjct 


61 


Query 


120 


Sbjct 


121 


Query 


180 


Sbjct 


181 


Query 


240 


Sbjct 


241 



>ref|ZP 01800055.11 PP-loop domain protein [Methanococcus maripaludis C7] 

gb|EDKT7850.1| PP-loop domain protein [Methanococcus maripaludis C7] 
Length=296 

Score = 137 bits (344), Expect = 7e-31, Method: Composition-based stats. 
Identities = 89/257 (34%), Positives = 141/257 (54%), Gaps = 27/257 (10%) 

KKICVLISGGMDSAVATKILQLSNTDVR-GLHITHRWMWFTPEIEIKRISKMLGIKDVV- 68 
+KI V++SGG DSAVA I + +V +H H+W W E K+++ + + 

EKIVVMLSGGKDSAVALAIAKDLGLNVHLCVHFVHKWSWDISTNEAKKLADRFNVPIIFP 98 

DITDELRRRLRGAKGKSV'CKICKKIM LEIAVSK-ASVVATGECGMDTIAGAVLDVS 123 

DIT+EL + + +GAKGKS+C+ICK IM + + IA + A ++ TGE + + I AG V 
DITEELAKKTQGAKGKSICRICKTIMKARMIDIAKEENAKIIMTGETALEKIAGPVFQYM 158 

RRTGIE PEFVQLPKRYFNGDDRI-IVRPLIRIHESDVKRLARLLGVKV 170 

R + E ++PKRY +1 RPLIR+ DV L + + + 

RENNSKLNRKDEFELYKKMELTKVPKRY KIHFFRPLIRVGHFDVFNLQKHYKIDI 213 

RRVGETGDLRRGRREGCPLQHLDPWVDVTDELMDEVWDVNVEALLVARRLGRRVSVKWPS 230 
+ RV E G+ REGC LQ+ P ++T EL D+++D+N +A +AR+ G R S+ P+ 

KRVSEAGNKIGYWREGCCLQYCSPTCELTTELFDDLYDLNKKATEIARKNGFRASITLPT 273 

FRI ILEGSPEERRHVAE 24 7 

I + PEE + + + + 
KEITV— IPEEEKYLKK 288 
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LOCUS 

DEFINITION 

ACCESSION 

VERSION 

DBSOURCE 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



EAW85671 1169 aa . linear PRI 18-DEC-2006 

BAIl-associated protein 3, isoform CRA_b [Homo sapiens] . 
EAW85 671 

EAW85671.1 GI:119606077 
accession CH471112 .2 

Homo sapiens (human) 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (residues 1 to 1169) 

Venter, J. C, Adams, M.D., Myers, E.W., Li,P.W., Mural, R. J., 
Sutton, t G.G. , Smith, H.O., Yandell,M., Evans, C. A., Holt, R. A., 
Gocayne, J.D. , Amanatides, P. , Ballew,R.M., Huson,D.H., Wortman, J. R . , 
Zhang, Q., Kodira,C.D., Zheng, X.H., Chen,L., Skupski,M., 
Subramanian,G. , Thomas, P. D., Zhang, J. , Gabor Miklos,G.L., 
Nelson, C, Broder,S., Clark, A. G., Nadeau,J., McKusick, V . A . , 
Zinder,N., Levine,A.J., Roberts , R . J. , Simon, M., Slayman,C, 
Hunkapiller,M. , Bolanos,R., Delcher,A., Dew, I . , Fasulo,D., 
Flanigan,M. , » Florea, L. , Halpern, A. , Hannenhalli , S . , Kravitz, S . , 
Levy,S., Mobarry,C, Reinert,K., Remington, K . , Abu-Threideh, J . , 
Beasley,E., Biddick,K., Bonazzi,V., Brandon,R., Cargill,M., 
Chandramouliswaran, I . , Charlab,R., Chaturvedi, K. , Deng,Z., Di 
Francesco, V. , Dunn,P., Eilbeck,K., Evangelista, C . , Gabrielian, A . E . , 
Gan,W., Ge,W., Gong,F., Gu,Z., Guan,P., Heiman,T.J., Higgins, M . E . , 
Ji,R.R., Ke,Z., Ketchum, K.A. , Lai,Z., Lei,Y., Li,Z., Li, J., 
Liang, Y., Lin,X., Lu,F., Merkulov, G . V . , Milshina,N., Moore, H.M., 
Naik,A.K., Narayan, V . A . , Neelam,B., Nusskern,D., Rusch,D.B., 
Salzberg,S., Shao,W., Shue,B., Sun, J., Wang,Z., Wang, A., Wang,X., 
Wang, J., Wei,M., Wides,R., Xiao,C, Yan,C, Yao, A . , Ye, J. , Zhan,M., 
Zhang, W., Zhang, H., Zhao,Q., Zheng, L., Zhong,F., Zhong,W., Zhu,S., 
Zhao,S., Gilbert, D., Baumhueter , S . , Spier, G., Carter, C, 
Cravchik,A., Woodage,T., Ali , F . , An, H . , Awe, A., Baldwin, D., 
Baden, H., Barnstead, M. , Barrow, I., Beeson,K., Busam,D., Carver, A., 
Center, A., Cheng, M.L., Curry, L., Danaher,S., Davenport , L . , 
Desilets,R., Dietz,S., Dodson,K., Doup,L., Ferriera,S., Garg,N., 
Gluecksmann, A. , Hart,B., Haynes,J., Haynes,C, Heiner,C, 
Hladun,S., Hostin,D., Houck,J., Howland, T., Ibegwam,C, Johnson, J., 
Kalush,F., Kline, L., Koduru,S., Love, A., Mann,F., May,D., 
McCawley,S., Mcintosh, T., McMullen,I., Moy , M . , Moy , L . , Murphy, B., 
Nelson, K., Pfannkoch,C. , Pratts,E., Puri,V., Qureshi,H., 
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Reardon,M., Rodriguez, R. , Rogers, Y.H., Romblad,D., Ruhfel,B., 
Scott, R., Sitter, C, Smallwood, M. , Stewart, E., Strong, R., Suh,E., 
Thomas, R., Tint,N.N., Tse,S., Vech,C, Wang,G., Wetter, J. , 
Williams, S., Williams,M., Windsor,S., Winn-Deen, E . , Wolfe, K., 
Zaveri,J., Zaveri,K., Abril,J.F., Guigo,R., Campbell, M . J. , 
Sjolander,K.V. , Karlak,B., Ke j ariwal , A . , Mi , H . , Lazareva,B., 
Hatton,T., Narechania, A . , Diemer,K., Muruganu j an, A . , Guo,N., 
Sato,S., Bafna,V., Istrail,S., Lippert,R., Schwartz, R., Walenz,B., 
Yooseph,S., Allen, D., Basu,A., Baxendale, J. , Blick,L., Caminha,M., 
Carnes-Stine, J. , Caulk, P., Chiang, Y.H., Coyne, M., Dahlke,C, 
Mays, A., Dombroski , M . , Donnelly, M., Ely, D . , Esparham,S., Fosler,C, 
Gire,H., Glanowski, S . , Glasser,K., Glodek,A., Gorokhov,M., 
Graham, K., Gropman,B., Harris, M., Heil,J., Henderson, S . , Hoover, J., 
Jennings, D . , Jordan, C, Jordan, J., Kasha, J. , Kagan,L., Kraft, C, 
Levitsky,A., Lewis, M., Liu,X., Lopez, J., Ma, D . , Majoros,W., 
McDaniel,J., Murphy, S., Newman, M . , Nguyen, T . , Nguyen, N., Nodell,M., 
Pan,S., Peck,J., Peterson, M., Rowe,W., Sanders,R., Scott, J., 
Simpson, M., Smith, T., Sprague,A., Stockwell , T . , Turner, R., 
Venter, E., Wang,M., Wen,M., Wu,D., Wu,M., Xia,A., 2andieh,A. and 
Zhu,X. 

TITLE The sequence of the human genome 

JOURNAL Science 291 (5507), 1304-1351 (2001) 
PUBMED 1118 1995 
REFERENCE 2 (residues 1 to 1169) 

AUTHORS Mural, R.J., Istrail,S., Sutton, G., Florea,L., Halpern, A . L .■, 
Mobarry, CM. , Lippert,R., Walenz,B., Shatkay,H., Dew, I . , 
Miller, J.R., Flanigan, M . J. , Edwards, N . J. , Bolanos,R., Fasulo,D., 
Halldorsson,B.V. , Hannenhalli , S . , Turner, R., Yooseph,S., Lu,F., 
Nusskern, D.R. , Shue,B.C, Zheng, X.H., Zhong,F., Delcher, A. L . , 
Huson,D.H., Kravitz,S.A., Mouchard,L., Reinert,K., Remington, K . A. , 
Clark, A. G. , Waterman, M . S . , Eichler,E.E. , Adams, M.D. , 
Hunkapiller,M.W. , Myers, E.W. and Venter, J.C. 

TITLE Direct Submission 

JOURNAL Submitted ( 05- JUL-2005 ) Celera Genomics, 45 W. Gude Dr., Rockville, 
MD 20850, USA 
REFERENCE 3 (residues 1 to 1169) 

AUTHORS Mural, R.J., Istrail,S., Sutton, G., Florea,L., Halpern, A . L . , 
Mobarry, CM. , Lippert,R., Walenz,B., Shatkay,H., Dew, I . , 
Miller, J.R., Flanigan, M . J. , Edwards, N . J. , Bolanos,R., Fasulo,D., 
Halldorsson, B . V . , Hannenhalli , S . , Turner, R., Yooseph,S., Lu,F., 
Nusskern, D.R. , Shue,B.C, Zheng, X.H., Zhong,F., Delcher , A . L . , 
Huson,D.H., Kravitz, S.A. , Mouchard,L., Reinert,K., Remington, K. A. , 
Clark, A. G. , Waterman, M . S . , Eichler,E.E. , Adams, M.D. , 
Hunkapiller,M.W. , Myers, E.W. and Venter, J.C. 

TITLE Direct Submission 

JOURNAL- Submitted ( 02-SEP-2005 ) Celera Genomics, 45 W. Gude Dr., Rockville, 
MD 20850, USA 
COMMENT Method: conceptual translation. 

FEATURES > Location/Qualifiers 
source 1 . .1169 

/organism="Homo sapiens" 
/db_xref="taxon: 9606" 
/chromosome=" 16" 
Protein 1 . . 1169 

/product="BAIl-associated protein 3, isoform CRA_b" 
Regio n 179. .34 9 

/region_name="C2 " 

/note="Protein kinase C conserved region 2 (CalB) ; 
Ca2+-binding motif present in phospholipases , protein 
kinases C, and synaptotagmins (among others) . Some do not 
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appear to contain Ca2+-binding sites; cd00030 M 
/db xref="CDD: 28912 " 
Region 543.. 624 

/region_name=" DUF104 1 11 

/note="Domain of Unknown Function (DUF1041). This family 
consists of several eukaryotic domains of unknown 
function. Members of this family are often found in tandem 
repeats and co-occur with pfam00168, pfam00130 and 
pfam00169 domains; pfam06292" 
/db xref="CDD: 69796 " 
Region 994.. 1096 

/region_name="C2_l" 

/note="Protein kinase C conserved region 2, subgroup 1; C2 
Ca2+-binding motif present in phospholipases, protein 
kinases C, and synaptotagmins (amongst others); some PKCs 
lack calcium dependence; cd00276" 
/db xref="CDD: 2934 4" 
CDS 1. .1169 

/gene="BAIAP3" 

/ locus_tag="hCG_4 2 689" 

/coded_by= ,? join(CH471112. 2: 1318504 . . 1318 634, 
CH471112. 2: 1318856. . 1318943, CH471112 . 2 : 1319067 . . 1319198, 
CH471112. 2: 1319450. . 1319557, CH471112 . 2 : 1320688 . .1320747, 
CH471 112. 2: 1320895. . 1320961 , CH471112 . 2 : 1321060 . .1321255, 
CH471 112. 2: 1321588. . 1321 632 , CH471112 . 2 : 1321752 . .1321829, 
CH471112. 2:1321947. . 132204 9, CH4 7 1 1 12 . 2 : 13222 93 . .1322400, 
CH471112. 2:1322485. . 1322580, CH471112 . 2 : 1322 679 . .1322782, 
CH47 1112. 2: 1323159. . 1323279, CH471112 . 2 : 1323788 . .1323912, 
CH471112. 2: 1323993. . 1324 083, CH471112 . 2 : 1324235 . .1324291, 
CH471112. 2: 1324367. . 1324 4 64 , CH471112 . 2 : 1324 564 . . 1324 615, 
CH471112. 2: 1324735. . 1324 865 , CH471112 . 2 : 1325015 . . 1325138, 
CH471112. 2: 1325513. . 1325656, CH47 1112 . 2 : 1325752 . .1325844, 
CH471112. 2:1325936. . 1326127, CH471112 . 2 : 1326208 . .1326276, 
CH471112. 2: 1326371. . 13264 50, CH471112 . 2 : 1326613 . .1326721, 
CH471112. 2: 1327064. . 1327135, CH471112 . 2 : 1327311 . .1327391, 
CH471112.2:13274 67. . 1327587 , CH471112 . 2 : 1327 660 . .1327783, 
CH471112.2:13278 62. . 1328067 , CH471112 . 2 : 1328 14 5 . .132824 8) " 
/note="gene_id=hCG42 689.2 transcript_id-hCT2356688 . 0 
protein_id=hCP1921942 . 0 isof orm=CRA_b" 
/db_xref ="GeneID : 8938 " 

ORIGIN 

1 mstlldikss vlrqvqvcps frrrteqdpg sasadpqepa tgawkpgdgv 
61 kkgegrqglp cleawgspcr qsphpgthtq vplrsgspap pepvdpslgl 
121 lyeealytvl yragtmgpdq vddeeallsy lqqvfgtsle ehteaiervr 
181 vsvmraknll akdpngfsdp ycmlgilpas datrepraqk eqrfgfrkgs 
241 iqvtevksst lnpvwkehfl feiedvstdq lhldiwdhdd dvslveacrk 
301 gryfkqivks arangtagpt edhtddflgc lnipvrevpv agvdrwfkle 
361 hchlvlklit tqrdtamsqr grsgflshll llshllrleh saeepnsssw 
421 ilclhgaqsn lsplqlavlh wqvssrhhqt ctldysyllg lledmqahwe 
481 esladslsaf sefglqllrq lrdyfpatns tavhrlelll kclgklqlfq 
541 elnmdiaaal krgnrewydr ilndkspreq pgpqrlpglv vladavyddl 
601 fhsilnvdvf tltfrqlerl vaeeawvlte elspkmtlev asglfelylt 
661 ipgrdsrsla lagihapflp avklwfqvlr dqakwrlqga vdmdtlepvd 
721 taglclshiq elwvrlawpd paqaqglgtq lgqdvceatl fytellrkkv 
781 vsealcvvln nvelvrkaag qalkglawpe gatgpegvlp rpllsctqal 
841 vtahltskmv gdirkyvqhi slspdsiqnd eavaplmkyl deklallnas 
901 ealwelllqa ilqalganrd vsadfysrfh ftlealvsff haegqglple 
• 961 keelrlhkcs trecieqfyl dklkqrtleq. nrfgrlsvrc hyeaaeqrla 
1021 pldanglsdp fvivelgpph lfplvrsqrt qvktrtlhpv ydelfyfsvp 
1081 vlftvmdhdw lstndfagea alglggvtgv arpqvgggar agqpvtlhlc 



ef f ahmrlml 
ralapeevem 
kakaptyalk 
krggplpakc 
lneviglkgm 
prssasrvqg 
rgelstpaat 
eapslpqeqe 
psf eicpf es 
qf cysvyasl 
ladlqrfwds 
assrhsssaa 
dtqpgaagea 
dddlqreaht 
lvkgnlsrvl 
slrdgsykrl 
vevlhaadll 
aeacrrraac 
rpraqvrsal 
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